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METHOD AND DEVICE FOR MONITORING AND FAULT DETECTION - 
IN INDUSTRIAL PROCESSES 

Technical field 



The present invention relates to a method and a device of monitoring and fault de- 
tection in industrial processes. More specifically, the present invention relates to a 
method of applying multivariate techniques in the sequential transfer of quality pa- 
rameters by means of score values and monitor the process with early fault detec- 
tion. 

Technical background 



In today's world of outsourcing, many of the steps in manufacturing process are 
actually made by other companies than by the company responsible for the final 
15 product Examples of products assembled in such a way include cars, computers, 
and telephone exchanges. The same approach applies to a product manufactured in 
several steps without assembly, e.g., a pharmaceutical tablet, a roll of printing paper, 



or a wafer in a semiconductor process. Sequential manufacturing makes for a strong 
need of tracking quality data through the whole manufacturing tree, assuring that all 
20 components and sub-components, as well as their combinations, have adequate 

quality, faults are early (high up) discovered in the tree, etc. When multiple data are 
measured in process steps and component quality testing, only multivariate tools can 
adequately use the information to evaluate the quality in the tree of manufacturing 
steps. 

25 

The products of today must meet increasing quality demands. The product quality is 
measured by many parameters and depends on many process variables in each step 
of the process chain. During each manufacturing step, process data are measured at 
certain intervals to control and monitor the process, and after each step intermediate 
30 quality data are measured on the sub-components and components, and then final 
quality data are measured on the final product However, even if many quality 
measurements are used, with reliance on traditional Statistical Process Control 
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(SPC) the small tolerated variation of each component leads to major difficulties, 
and rejection of a portion of well working product 

Several techniques are used for the purpose of monitoring a process. Parameters to 
5 check include quality, yield, energy, product rejection, etc. A conventional ap- 
proach for monitoring a process is to consider one variable at a time (univariate SPC 
). This approach is not adequate for obtaining the best quality, economy, etc. of a 
product in a manufacturing process, since actually several variables are involved. 

10 The most commonly employed type of SPC uses single variable control charts. 
When a given product or process is outside of a specification it is indicated. The 
limitation with SPC is that only few variables, generally at the most around 5, can 
be used for monitoring the process. 

15 The quality of intermediate and end products is in most cases described by values of 
a set of variables, the product specification. The specifications are often used in a 
univariate mode, i.e. they are checked individually for conformation within the 
~ speciS^Ionvaiue i^ge?Tlns^vS13se to both false negative and false posiHve 
classification, since the quality variables very rarely are independent in practice, but 

20 are treated as if they were, i.e. univariately. 

It is possible to use traditional SPC to establish when a process is out of specifica- 
tion when only a few variables are involved. However, when the number of process 
variables and quality increase or when they interact, problems arise. Very often it is 
25 difficult to determine the source of the problem, particularly when the number of 
process variables increases. Product quality is typically a multivariate property and 
must be treated as such in order to monitor a process in that respect 

In order to optimize and control a process with several variables, projection tech- 
30 niques such as Principal Components Analysis (PCA) and Projection to Latent 

Structure (PLS) have been applied These techniques are well described (Mac Gre- 
gor et al.) and further development has been made to address the process control 
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need of today. S.Wold et ah (Hierarchical multi-block PLS and PC models, for eas- 
ier interpretation, and as an alternative to variable selection. J.Chemometrics 10 
(1996) 463-482), describes a method where the variables are divided into conceptu- 
ally meaningful blocks before applying hierarchical multi-block PLS or PC models. 
5 This allows an interpretation focused on pertinent blocks and their dominant vari- 
ables. Such blocking can be used in process modeling and modeling. 

Attempts based on SPC and projection techniques have been made to control a pro- 
cess. WO 99/19780 describes a method and device for controlling an essentially 

10 continuous process comprising at least two sub-processes, which minimizes the re- 
jection of the produced product The method is based upon combining multivariate 
models with a processed variable value. A variable value for a subsequent second 
sub process is predicted based on the combination of the multivariate model and the 
processed variable value. However, the method only utilizes multivariate data 

1 5 analysis in respect of controlling the process and not for checking or monitoring. 
Furthermore the method is applied only for a specific application and can not be 
use^jn general app lications. 

C. Wikstrom et ah (Multivariate process and quality monitoring applied to an elec- 
20 trolysis process. Part 1. Process supervision with multivariate control charts Che- 

mometrics and intelligent laboratory syst em 42 (1998 ) 221-23 l), describesJMultL: 

variate Statistical Process Control (MSPC) applied to an electrolysis process and the 
benefit with multivariate analysis over traditional univariate analysis is discussed 
Moreover the article shows how the result from a multivariate principal component 
25 analysis method can be displayed graphically in multivariate statistical control 

charts. By using this informationally efficient MSPC approach, rather than any inef- 
ficient SPC technique, the potential of achieving major improvements in the under- 
standing and monitoring of the process is shown. The improvements are, however, 
not sufficient to be able to control the quality problems in complex processes unless 
30 specific experimentation has been made to make the multivariate model invertable 
and thus also capable to determine how the process should be modified to minimize 
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deviation from the specification profile. In MSPC as well as SPC "controlling" 
should be synonymous with "checking" or "monitoring". 

Another drawback with the application of prior art to sequential manufacturing is 
5 the need to cany relevant information from different process steps in a process 
chain, which can not be easily achieved by known techniques. Therefore, a need 
exists to describe product quality in a sequential monitoring process by a multivari- 
ate model of the relevant quality variables rather than the individual variables them- 
selves. 

10 

A problem with prior art (univariate SPC) is that the quality variables are not inde- 
pendent, but their interdependencies get lost if they are analyzed or monitored indi- 
vidually. The risk for false product approval increases and when this occurs and is 
fed back in the supply chain the specification intervals are usually narrowed in order 
15 to secure product However, this rarely eliminates the problem of false product ap- 
proval and also give rise to substantial false rejects. 



Summary of invention 

20 The present invention is directed to overcoming the problems set forth above espe- 
cially to monitor and detect faults at the earliest possible stage in a process chain. 

This is accomplished according the invention by a method for monitoring of and 
fault detection in an industrial process, comprising at least a first sub-process and at 

25 least one second sub-process arranged in a process chain, comprising, for the at least 
one second sub-process the steps of collecting data and calculating a multivariate 
sub-model based on the collected data, said method being characterized by the steps 
of receiving in the first sub-process from the at least second sub-process information 
related to the multivariate sub-model calculated for the at least second sub-process, 

30 collecting data related to the first sub-process, and calculating a multivariate sub- 
model for the first sub-process based on collected data and received information. 
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The current invention provides a method and a device for multivariate quality as- 
sessment and sequential transport of quality measures between subsequent produc- 
ing entities. Early fault detection in the process chain is achieved by fault diagnosis 
tools such as outlier detection etc., and on-line monitoring of new data measured 
5 from the beginning of the manufacture chain. This methodology has several advan- 
tages compared to traditional univariate quality assessment. The risk for false prod- 
uct approval decreases. Multivariate quality assessment allows the number of qual- 
ity variables to increase and still be meaningful as product specification. This is due 
to the inherent capability of multivariate techniques to reduce dimensionality with- 
10 out losing relevant information in the data. This allows for instance inclusion of en- 
tire spectra from spectroscopic measurements to be used as quality assessment. Near 
Infra Red (NTR) sepctroscopy has become increasingly used for this purpose. 

It is also possible to predict product properties directly from the process, i.e. to 
15 translate die production state into predicted product properties. This is sometimes 
called soft sensors. 

ctcwcot In.midtivariate_quahtyiassessment_fo^ sam ples are 

used to build a multivariate model using PCA or PLS. Each produced unit is then 
projected on this model and classified according to its dissimilarity to the model. 

20 

The current invention use multivariate models in sequenc e. The producin gentitv 

applies a multivariate model for describing the quality. This model consists of a 
number of latent variables, the same latent variables are used by the receiving entity 
as a means of checking the quality of each delivered unit. But they can also be used 

25 as X-variables in the receiving entity's process step to give quality assessment of 
each individual step taken into use into the further processing. 

The data are arranged in a dynamically updated tree with the root being the final 
product data and each branch and twig being the data of a component or sub- 
30 component 
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This feature concerns a novel method to analyze multivariate data from a tree 
structured manufacturing process. In a preferred embodiment this novel method 
gives information about each manufacturing step and its results, as well as the com- 
bined influence of all upstream ("up-tree") steps on later steps and on the final 
5 product Preferably the novel approach allows the detection of faults and upsets in 
separate steps as well as in combinations of several or all steps, and points to which 
process variables and steps that together are related to these faults. Thus, an over- 
view of all the process data are obtained in a way corresponding to the structure of 
the process, providing information about the overall quality as well as the quality of 
10 each individual step. 

Preferably this novel approach provides an adequate infrastructure to monitor a 
complicated chain of components manufacturing and assembly in such a way that 
the adequate quality of the final product is assured; Pertinent graphics are included, 
15 showing the status of the overall chain as well as parts and individual steps in terms 
of one-, two-, and three-dimensional multivariate control charts. Additional multi- 
variate plots are included for the detailed interpretation of the patterns seen in these 

20 We here discuss the novel approach as applied to the discrete manufacturing of a 
product consisting of components and sub-components, e.g., a computer, a tele- 
phone, a car, or a camera. Precisely the same approach applies also to a product 
manufactured in several steps without assembly, e.g., a pharmaceutical tablet, a roll 
of printing paper, or a wafer in a semiconductor process. 

25 

Brief description of the drawings 

Figure 1 shows the information flow in a simple straight chain sequence and quality 
dataY. 

30 

Figure 2 shows the information flow in a simple example of a "forked" sequence. 
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Figure 3 shows PLS regression coefficients (b) for step 1 1, yl 1 (left), xl 1 and xl2 
are seen to dominate both y-models, and xl4 is unimportant. 

Figure 4 shows PLS regression coefficients (b) for step 11, yl2, xl 1 and xl2 are 
5 seen to dominate both y-models, and xl4 is unimportant 

Figure 5 shows the data structure for a simple process with four steps with two 
merging branches in the final step. 

10 Definitions 

An "industrial process" means a process comprising one or several steps performed 

within that specific process or one or several steps performed at sub processes, and 

if 

could be described as a multivariate process. 

Bold characters are used for vectors and matrices, lower case for vectors, e.g., u, 
and upper case fo r matrice s, e.g., X. 

The process consists of J steps (j= 1 , 2, . . . , J), where we assume that the last (J) is the 
20 final step, giving the final product. 

One "NEPALS-PLS round" means from present u's (local and from chain below), 
use an average of these u's, multiply that into X to give w% normalize w, calculate t 
and X w, and finally c as t'Y/(t't). 

25 The training set is the historical data measured on all steps from instances of accept- 
able product, N observations with the individual index i (i=l,2,. . .,N), these data will 
be used to develop a model of how data in all steps and in their combination should 
look for acceptable product 

30 On each step (j), multiple process variables (xjk) are measured, with the value of the 
individual observation being x^. For step j, the training set data comprise the matrix 
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Optionally, after each step (j), quality variables (y jm ) are measured, with the value of 
the individual observation being y 8m . For step j, the training set data comprise the 
matrix Y,. For steps lacking quality data (y), principal components like calculations 
will be made, for steps having quality data PLS like calculation will be made. 

A coefficient b jk shows how much a variable^ contributes to the model of step 1 1. 

A sub-process is a process step in a straight process chain, a forked process chain or 
a combination of both. 

c .■ 

A sub-model is a multivariate model calculated for a specific subprocess, which can 
include information from one or several sub-models. c 

Outliers could be samples that are out of a specification, outside a process control 
limit, etc.. 



The items displayed in a score plot are the score values of the observations for a 

sifted 3 



Knt^mtodeITihueiisions)7otten components one and" 
two, or one, two, and three. The score values are weighted averages of all variables 
(measurements, properties) with the weights determined by (a) the loadings of the 
variables of the specified model components, and (b) the scaling parameters of the 
corresponding model. Thus, the scores are new variables that constitute summaries 
of the original variables. Deviations in the score values of an observation from the 
normal intervals of these scores indicate the observation to be different from the 
ones situated within the normal score intervals; it is an outlier. 



A scaled raw data (contribution) plot for an observation, shows the difference be- 
tween its individual variable values in scaled form and the corresponding scaled 
values of a reference observation. This reference observation is often the (hypotheti- 
cal) average of all "good" observations (giving good final quality) in the training 
set, or the set point of manipulated variables together with the average values of the 
other variables. The scaled raw data (contribution) plot indicates which combination 
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of variables that is related to die significant deviation of the observation (makin g it 
an outlier) - these are the variables deviating the most from their "nonnal" values. 
Thus the scaled raw data (contribution) plot is useful for finding the reason for an 
observation deviating from the "normal" ones, as seen in, for instance, a score plot 
or a residual std dev (DModX) plot 

A residual std dev (DModX) plot describes how much the scaled data vector of each 
observation differs from its "ideal", this being the model value of the observation, 
i.e., the weighted sum of the model loadings with the weights being the score values 
of the observation. A large residual std dev (DModX) value, significantly outside its 
normal range, indicated that this observation deviates from the model of the normal 
observations, it is an outlier. The indications for process upsets are often seen first in 
the residual std dev (DModX) plot 



Detailed description of the invention 

In the following the invention will be described in more detail by means of exam- 



fflirwBic^ 
the scope of the invention. 

Like any modeling, sequential multivariate modeling is made in two phases; 

(1) a "training" phase based on historical data leading to a multivariate model. In 
the present case this consists of, a chain of multivariate submodels. 

(2) a "prediction" phase where the model from phase (1) is used to evaluate new 
incoming data to detect deviations from "normal operation", and predict 
properties of the resulting product or intermediate product of a subprocess. 

In the present case of sequential modeling, historical data are collected for each step 
in the manufacturing process chain, and a sub-model is developed for each such 
step. The models are then connected in a chain corresponding to the process chain 
(this connection can be done in several slightly different ways), and in the prediction 
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step applied on-line to production data for diagnosis and quality assessment and 
prediction of quality further down the chain. 

Early fault detection in the process chain is achieved by fault diagnosis tools such as 
5 outlier detection etc. Plots such as contribution, score, coefficients etc. can be used 
as diagnosis tools. On-line monitoring of new data measured from the beginning of 
the manufacture chain, give information about the status of the process chain in total 
as well as each of it's steps, and identifies the from normal process operation in out- 
liers in scores and other diagnostic plots. 

10 

Examples 

In the following two small examples we will indicate how information from one 
step of a sequential process can be carried "down Stream" by means of scores of a 
15 multivariate model of the step. Subsequent steps will use these scores as variables 
together with the process variables in those subsequent steps. 

Lathis way one gets a monitoring^pproacKTwffich a) eacK15Sefme3iate product 
meets specification and b) ensures that all intermediate products from different pro- 
20 cess streams fit together in intermediate and final products. In addition to process 
monitoring and only fault detection, this can be used, for instance, to match inter- 
mediates when the intermediate product differ in size, large ones go together, and 
small ones go together or, just to make sure that the final product has all properties 
in the right proportion, including those of its components. 

25 

To illustrate the models, information flow, and algorithms, two small illustrative ex- 
amples are given below with only 5 observations and three steps in each. 

The simplest sequential algorithm is used for the illustration, where separate stan- 
30 dard two block (X,Y) PLS models (no hierarchical structure) are made for the "left 
end" blocks of each chain, i.e., block (XI, Yl) in Figure 1, and blocks (X4, Y4) and 
(X5, Y5) in Figure 2. Thereafter the scores (tl 1, tl2, t41, t42, t5) of these blocks are 



WO 2004/003671 PCT/SE2003/001 134 

11 

included as extra process variables in the blocks (X2, X6) next to the right (down 
stream) of these "end blocks", and PLS models made of these blocks. The resulting 
scores are then used as extra process variables in the blocks next to the right (down 
stream), etc., until the end. 

5 

This simplest way of calculation gives two PLS-components for stepl 1 in Figure 1, 
the same for step 21 in Figure 2, and one component for step 22 in Figure 2. 

Referring to Figure 1, a straight chain sequence in three steps is shown. We may 
i 0 think of this as a very simple example of a pharmaceutical manufacturing, where a 
tablet is made in three consecutive steps; granulation (step 11), mixing (step 12), 
and tabletting (step 13). The three model properties are shown in table 1. 

Table 1 

15 





Model 


Number of vari- 
ables (X*s) 


Number of t's from 
previous step 


Number of measured 
quality variables (Y's) 


Resulting number 
of PLS components 




step 11 


4 




2 


2 




step 12 


3 


2 


1 


3 




step 13 


3 


3 


2 


3 



Step 11, this first step simulates a simplified granulation with four process variables 
(e.g., temperature (xTl), flow (xT2), concentration (xI3), spray pressure (xl4J) and~~ 
two measured quality variables (responses), e.g., granulate particle size (yl 1) and 
20 homogeneity (yl2) (standard deviation of particle size). The data values below have 
been centered (average subtracted) and scaled, to make them uninteresting as such. 

Table 2 shows data Xl=[xll,xl2, xl3,xl4] and Yl=[yll, yl2] of step 11, together 
with resulting two score vectors tl 1 and tl2, and PLS coefficients w* and c. In the 
25 PLS model, the scores are calculated from the raw data as: tj a = 5^. X& w^*, and each 
y-vector is modeled by the scores ast y^ — - 2 a t^ + (residuals), or, equiva- 
lently, y^ = 2^ x^ b^ + 4n • The data are analyzed in original form, with no addi- 
tional centering or scaling. 
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obs / vec 


xll 


xl2 


xl3 


xl4 




yll 


yl2 




til 


tl2 


1 


0.188 


0.076 


0.175 


0.091 




0.248 


0.191 




0.280 


0.062 


2 


-0.316 


0.029 


-0.367 


-0379 




-0314 


-0.250 




-0.553 


0.143 


3 


0.599 


0.023 


0.561 


0.591 




0.624 


0.563 




0.951 


-0.127 


4 


-0.033 


0312 


-0.213 


-0335 




0.028 


0.139 




-0.175 


0.435 


5 


-0.438 


-0.441 


-0.156 


0.032 




-0.587 


-0.644 




-0.503 


-0.513 
























6 


0.63 


0.061 


0.587 


0.507 










0.963 


-0.047 


7 


0.83 


0.001 


0.9 


0.807 










1.366 


-0.212 
























w* and c 1 


0.699 


0.295 


0.514 


0.401 




0.717 


0.653 








w* and c 2 


0.368 


0.813 


-0.139 


-0.487 




0.415 


0.600 








byll 


0.653 


0.549 


0311 


0.085 














byl2 


0.677 


0.680 


0.253 


-0.031 















Data (XI and Yl), scores (tl 1 and tl2), and model coefficients (w*, c, and b) of 
step 1 1 are shown in table 2. Observation 6 and 7 constitute the "prediction set", 
5 which is not used for model development, but rather to simulate the "on-line" 
monitoring of the process at a later stage. 



The regression coefficient plot of y 1 1 shown in Figure 3, indicates that variable xl 1 
and xl2 dominate the yl 1 -model and that xl4 is unimportant The regression coef- 
10 ficient plot of y 1 2 shown in Figure 4, indicates that variable xl 1 and xl2 dominate 
the yl2-model and that xl4 is unimportant Further the variable xl3 is seen to be 
about half as important as the dominating xl 1 and xl2 in both y-models. 

Step 12, in this simulated mixing step, has three process variables, feed rate of the 
15 constituents (x21), stirring rate (x22), and mixing time (x23). There is one y- 

variable, the resulting homogeneity (y2). As in step 1 1, the data are centered and 
scaled to make them uninteresting as such. 



20 



In this step, the influence of step 1 1 is modeled by means of the two scores resulting 
in the step 11 model These two vectors (til and tl2) are appended to the X2-matrix 
of step 12 to give totally five variables (x21, x22, x23, tl 1, and tl2) in the X-matrix 
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of step 12. The PLS analysis of these data gives three components, the scores of 
which are used as additional variables in step 13. Step 12 data (X2 augmented with 
the two scores of step 11, and Y2), scores (t21, t22, and t23), and Y = Y2, and mod- 
els coefficients (w*, c, and b), are shown in table 3. 

5 



Table 3 



obs / vec 


x21 


x22 


x23 


til 


tl2 




y2 




t21 


t22 


t23 


1 


-0.364 


-0.355 


-0.287 


0.280 


0.062 




-0.336 




-0.339 


-0.278 


0.364 


2 


0.197 


0.096 


0.164 


-0.553 


0.143 




-0.250 




-0.121 


0.227 


-0.456 


3 


0.427 


0.454 


0.423 


0.951 


-0.127 




0.979 




1.091 


-0.347 


-0.026 


4 


-0.358 


-0.279 


-0.395 


-0.175 


0.435 




-0.840 




-0.726 


-0.334 


-0.104 


5 


0.098 


0.084 


0.095 


-0.503 


-0.513 




0.447 




0.095 


0.731 


0.221 


























w* and c ] 


0.448 


0.436 


0.453 


0.481 


-0.416 




1.005 










w* and c 2 


0.064 


0.049 


0.091 


-0.568 


-0.830 




0.340 










w* and c 3 


-0.409 


-0.282 


-0.303 


0.305 


-0.909 




0.380 










b y2 


0.317 


0.347 


0.371 


0.406 


-1.046 















Step 13, in this simulated tabletting step, there are three process variables, punching 
— "^j^55u5e^3 ^^l77nScKi55"speedlS2), and filiSLgTateXSljrTEere are two ^r™^™* 
10 variables, the resulting tablet hardness (y3 1) and uniformity (y32). As previous 

steps, the data are centered and scaled. 



In this step, the influence of step 12 (and indirectly of step 1 1) is modeled by means 
of the three scores resulting in the step 12 model. These three vectors (t21 9 122, and 
15 t23) are appended to the X3-matrix of step 13 to give totally six variables (x3 1, x32, 
x33, t21, t22, and t23) in the X-matrix of step 13. 

The PLS analysis of these data gives three components, denoted t3 1, t32, and t33 
below. Step 13 sq data (X3 augmented with the three scores of step 12, and Y = 
20 Y3), and model coefficients (w*, c, and b), are shown in table 4. 
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Table 4 



obs / vec 


x31 


x32 


x33 


t21 


t22 


t23 




y31 


y32 




t31 


t32 


t33 


1 


0.409 


-0.120 


0.180 


-0.339 


-0.278 


0.364 




0.633 


-0.128 




-0.105 


0.526 


0.499 


2 


-0.098 


0.086 


-0.013 


-0.121 


0.227 


-0.456 




-0.446 


-0.189 




-0.179 




-U.4/0 


3 


0.099 


0.479 


0.301 


1.091 


-0.347 


-0.026 




0.411 


1.310 




1.269 


-0.047 


-0.U3y 


4 


-0.156 


0.075 


-0.086 


-0.726 


-0.334 


-0.104 




0.422 


-0.875 




-0.544 


/\ C A*l 

0.547 


-O.Zzo 


5 


-0.254 


-0.520 


-0.382 


0.095 


0.731 


0.221 




-1.020 


-0.117 




-0.441 


-0.874 


1 A A 

0.244 






























w* and c 1 


0.178 


0.337 


0.279 


0.835 


-0.2oU 


U.04y 






1 f\AA 










w* and c 2 


0.272 


0.264 


0.262 


-0.489 


-0.736 


0.081 




1.102 


-0.349 










w* and c 3 


0.376 


-0.397 


0.041 


0.094 


-0.061 


0.836 




0.286 


0.247 










by31 


0.470 


0.297 


0.399 


-0.217 


-0.927 


0.345 
















by32 


0.183 


0.161 


0.210 


1.065 


-0.049 


0.230 

















Figure 2 has a "fork", where the last step receives material (and information) from 
two chains, each with just one step. This may be seen as a simplified version of the 
manufacturing of a small computer, putting together two components, say a mother 
board and a power source, as a last step. Here, in addition to the training set of five 
observations, a prediction set with two additional observations is used to show the 



Note that step 21 is identical to step 1 1 in Figure 1 above, and hence not shown 
again below. The three models have the following properties, shown in table 5. 



Table 5 



Model 


Number of 


Number of t*s from 


Number of measured 


Resulting number of 




variables (X's) 


previous step 


parameters (Y's) 


PLS components 


step 21 


4 




2 


2 


step 22 


3 




1 


1 


step 23 


3 


3 


2 


3 



Step 21 in Figure 2 is simulated to have the same data as step 11 above shown in ta- 
ble 2, Le. X1=X4, Y1=Y4, tl l=t41 , and tl2=t42, and is hence not further discussed. 
Of course the variables here (for a computer motherboard manufacturing) are differ- 
ent from those of a pharmaceutical granulation, but after centering and scaling they 
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here are identical in values to simplify the examples. Hence also the PLS model of 
this step is identical to that of step 1 1 . 

Step 22 in Figure 2, here the X-data have the same number of variables and the 
5 same values as step 12 in Figure 1 . However, since there is no step preceding step 
22 in the forked model, the PLS model is not identical, and hence the table of data 
and results is given below. Only one PLS component is significant 

Step 22 data (X5 and Y5), score (t5), and model coefficients (w*, c, and b). Both the 
10 prediction set observations, 6 and 7, fit the model within the 5 % level, in table 6. 



Table 6 

obs / vec 



1 



w* and c 1 



x51 



-0.364 



0.197 



0.427 



-0.358 



0.098 



0.332 



-0.368 



0:600 



0.352 



x52 



-0.355 



0.096 



0.454 



-0.279 



0.084 



0.510 



-0.540 



0.548 



0.321 



x53 



-0.287 



0.164 



0.423 



-0.395 



0.095 



0.579 



-0.521 



0.584 



0.342 



y5 



-0.276 



0.153 



0.431 



-0.422 



0.113 



.0.587 



t5 



-0.580 



0.266 



0.752 



-0.598 



0.160 



0.816 



-0.820 



DModX 



3.260 



2.573 



PmodX 



0.048 



0.085 



Step 23, in this simulated manufacturing step comprise three process variables, three 
relative position measurements (x3 1, x32, and x33) of the motherboard and of the 
power source. There are two y-variables, two measured voltages (y3 1, y32) on the 
mounted computer. As previous steps, the data are centered and scaled. 



In this step 23, the influence of steps 21 and 22 is modeled by means of the three 
20 scores resulting in the step 21 and step 22 models. These three vectors (t41, t42, and 
t5) are appended to the X-matrix of step 23 to give totally six variables in X6 (x3 1 , 
x32, x33, t41, t42, and t5) of step 23. 
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The PLS analysis of these data gives three components, denoted t61, t62, and t63, 
and step 23 data (X6 and Y6), scores, and model coefficients (w*, c, and b) are 
shown below in table 7. The prediction set observation 7 fits the model, while no 6 
is a significant outlier as shown in table 7. In a DmodX-plot no 6 would be signifi- 
cantly outside its normal range, indicating that this observation deviates from the 
model of the normal observations and classified as an outlier 



Table 7 



obs/vec 


x61 


x62 


x63 


t41 


t42 


t5 


y61 


y62 


t61 


t62 


t63 


DmodX 


PModX 


; 1 


0,409 


-0.12 


0,18 


0,28 


0.062 


-0.58 


0,633 


-0,128 


0,05 


0,644 


0,427 






2 


-0.098 


0.086 


-O.013 


-0.553 


0,143 


0.266 


-0,446 


-0.189 


-0,312 


-0,28 


-0,36 






3 


0,099 


0,479 


0,301 


0.951 


-0,127 


0.752 


0,411 


1.31 


1,321 


-0.226 


-0,065 






4 


-0.156 


0.075 


-0.086 


-0.175 


0,434 


-0.598 


0.422 


-0.875 


-0,448 


0,587 


-0,299 






5 


-0.254 


-0.52 


-0,382 


-0.503 


-0.513 


0,16 


-1,02 


-0,117 


-0,611 


-0,725 


0.297 




















pred 

yi 


pred 
Y2 












6 


-0.353 


-0.999 


-0.683 


0,963 


-0.048 


0,816 


-0,573 


1.002 


0,494 


-0,78 


0,406 


5,555 


0,019 


7 


0 


-0.2 


-0.1 


1.366 


-0.213 


-0,82 


1,072 


0,358 


0,614 


0,81 


0,703 


4,143 


0,057 




w1 


w2 


w3 


w4 


w5 


w6 


d 


c2 












w* and c1 


0,189 


0,336 


0.282 


0.765 


-0,07 


0,427 


0,474 


0,929 












w* and c2 


0.22 


0.178 


0,188 


0.255 


0,496 


-0,757 


1.009 


-0,534 












w* and c3 


0.502 


-0,552 


0.061 


0.222 


-0.595 


-0.206 


-0,051 


0,313 












by61 


0.266 


0.367 


0.32 


0,609 


0.498 


-0,551 
















by62 


0.215 


0.044 


_0.18_1_ 


Jk643„ 





















With Figure 2, the two prediction set observations are then used in the prediction 
phase of the sequential analysis. The prediction set (observation 6 and 7) give the 
values of scores and residual standard deviations (DmodX) shown above. It is seen 
that observation 6 is rather far away from the model and is a significant outlier. 
With more variables it would definitely be even clearer. A contribution plot would 
show that the main variables deviating are x3 1 and x33, i.e., two of the step 23 pro- 
cess variables. 

Distances to the model (residual standard deviations) of the five training set obser- 
vations (1-5) and the two prediction set observations. The first one (no 6) is seen to 
be outside the critical limit, and hence classified as an outlier. 

Transmitting information and data between the different steps can be performed in 
different directions. In Figure 2, step 23 may also transmit information and data to 
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step 21 and/or step 22. Analogously in Figure 1, step 13 may also transmit informa- 
tion and data to step 12. 

In the following text the invention will further be described in detail. Data are di- 
5 vided into blocks for each step, with Xj indicating process data measured on stepy 
and Yj indicating quality and other result data measured on the same step. Figure 5 
shows the data structure for a simple process with four steps (71,72,73, and 74) with 
two merging branches in the final step 74. The scores t, (t7, t8, t9, and tlO) carry the 
information of X, (X7, X8, X9, and X1G) to later steps, and the scores u } (U7, U8, 
10 U9, and U10) summarize the Y-block (Y7, Y8, Y9, and Y10) of the same step. 

For simplicity we assume only one significant component for each step. In reality, 
of course they are usually more. 

15 Hence, the model estimate in phase one has a number of sub models, one for each 
step (71 to 74 in Figure 5), plus a mechanism for carrying the sequential transfer of 
quality parameters from a step forwards (down stream) in the chain. 

There are several possible variants of algorithms to estimate the sequential model 
20 from the training set 

A simple algorithm would be to start with a simple PLS model for each step being 
leftmost in a chain, above step 71 and step 73. The number of components in each 
step model (Aj) is determined so that the scores of each step (tj) adequately capture 
25 the systematic variation in the corresponding Xj matrices. These scores are then in- 
cluded among the X-variables in the next model to the right, either as just exten- 
sions of the X-block, or, as separate blocks, one per component Thus, in the second 
variant, if there were Aj components in the model to the left of the present step, 
there will be 1-hA, blocks in a hierarchical PLS model of the present step. 
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The resulting score(s) of the second step models (one per chain in this example) are 
then carrying the information to the next step to the right in the chain, etc., until ei- 
ther the process chain finishes, or merges with another chain as in Figure 5. 

5 In steps where several chains merge, the procedure is the same, except that a set of 
scores appear from each chain, and the extension of X includes all these scores, ei- 
ther just as additional variables, or, as a set of one variable blocks, Aj+A k in number, 
assuming that the previous models in two merging chains were Aj and A k , respec- 
tively. 

10 

The model is now finished, and can be used in a prediction phase (with additional 
variables, data etc.) starting with the earliest steps, and sequentially moving the re- 
sulting scores down the chain(s) together with the variables measured in each sub- 
sequent step. 

15 

In a more elaborate model, one can weave the estimation back and forth, using the 
model with only one component above as a start Then, in the 4C back-ward" phase, 

(final) Y-block. That will, to begin with, produce a u-vector for this final block, 
20 which then is carried backward to previous blocks together with its Y-summaries 
(uj), giving after one NBPALS-PLS round a joint u-vector for the previous step, 
which then is carried backwards another step, etc. Once the end is reached, one has 
t-scores for the leftmost step sub models, and the directions are changed to for- 
wards, etc., until convergence. 

25 

The X-blocks of all steps are then deflated by t*p' , and a second component started 
just as in any hierarchical PLS model. 

After an adequate number of model components (cross-validation or other estima- 
30 tion of model complexity), the model is finished, and can be used in a prediction 
phase starting with the earliest steps, and sequentially moving the resulting scores 
down the chain(s) together with the variables measured in each subsequent step. 



